f».  ®is  •■}••()  , 


I  ,  lament  hot  biS 

I  ^LSHbHG  nhtm  «*  * 

UNITED  STATES  AIR  FORCE  Ujgggga  *■ 

AIR  UNIVERSITY  **”* 

AIR  FORCE  INSTITUTE  OF  TECHNOLOGY 

Wright-Patterson  Air  Force  Base, Ohio 


AFIT/GOR/SM/7  9D- 4 


^AIRCRAFT  £IRFRAME  £OST  ^STIMATION 


i 


Approved  for  public  release;  distribution  unlimited 


SECURITY  CLASSIFICATION  Of  THIS  PAGE  (Whin  Pete  Entered) 

I  REPORT  DOCUMENTATION  PAGE 


REPORT  HUMBER 


]2.  GOVT  ACCESSION  NO. 


AFIT/GOR/SM/79D-4 


4.  TITLE  (end  Subtitle) 

AIRCRAFT  AIRFRAME  COST  ESTIMATION 
USING  A  RANDOM  COEFFICIENTS  MODEL 


READ  INSTRUCTIONS 

_ BEFORE  COMPLETING  FORM 

3.  RECIPIENT'S  CATALOG  NUMBER 


5.  TYPE  OF  REPORT  &  PERIOD  COVEREO 


AFIT  Thesis 


«.  PERFORMING  ORG.  REPORT  NUMBER 


7.  AUTHORS) 

James  H.  Hinch 
Captain  USAF 

9  PERFORMING  ORGANIZATION  NAME  AND  ADDRESS 

Air  Force  Institute  of  Technology 
AFIT/EN 

Wright-Patterson  Air  Force  Base,  OH  45433 

It.  CONTROLLING  OFFICE  NAME  AND  ADDRESS 


8.  CONTRACT  OR  GRANT  NUMBERf*) 


10.  PROGRAM  ELEMENT,  PROJECT.  TASK 
AREA  4  WORK  UNIT  NUMBERS 


IZ.  REPORT  DATE 


December  1979 

13.  NUMBER  OF  PAGES 


4.  MONITORING  AGENCY  NAME  4  ADDRESS^//  different  from  Controlling  Office)  I  15.  SECURITY  CLASS,  (of  thie  "port) 


15a.  DECLASSI  FI  CATION/' DOWN  GRADING 
SCHEDULE 


116.  DISTRIBUTION  STATEMENT  (ol  thli  Report) 


Approved  for  public  release;  distribution  unlimited.  Appendices  A 
and  B  are  limited  to  U.S.  Government  agencies  only;  proprietary 
information;  December  1979.  Other  requests  for  Appendices  A  and  B 
of  this  document  must  be  referred  to  Dean  of  Engineering,  Air 
Force  Institute  of  Technology  (AFIT/EN) ,  Wright-Patterson  Air _ 


17.  DISTRIBUTION  STATEMENT  (ol  (ha  ebelrect  entered  In  Block  30,  II  dlllerent  Iron,  Report) 


|  JOSEPQ.  HIPPST,  Major/  USAF 
Director  of  Public  Affairs 

18.  SUPPLEMENTARY  NOTES 


10  Dec  79 


1 19.  KEY  WORDS  (Continue  on  reveree  eide  if heceeeiry  end  Identify  by  btock  number) 


Parametric  Cost  Estimation 
Aircraft  Airframe 
Random  Coefficients  Model 


Parametric  Airframe  Cost  Estimation 
Airframe  Cost  Estimation 
Aircraft  Cost  Estimation 


20.  ABSTRACT  (Continue  on  reveree  aide  It  neceeeery  end  /d©nf//y  by  block  number) 

/^Previous  studies  into  aircraft  airframe  acquisition  costs  have 
either  not  dealt  with  the  learning  phenomenon  or  have  assumed  that 
the  learning  curve  slope  is  the  same  for  all  types  of  aircraft. 
However,  some  results  have  indicated  that  this  is  not  truly  the 
case.  The  random  coefficients  model,  as  applied  in  this  study, 
provides  a  framework  in  which  the  slopes  can  differ  by  estimating 
their  values  based  on  other  characteristics  of  the  aircraft.  _ _ ^ 


on  roRM 

uu  !  JAN  73 


EDITION  OF  1  NOV  65  IS  OBSOLETE 


SECURITY  CLASSIFICATION  OF  THIS  PAGE  (Wian  Dot*  Entered) 


SECURITY  CLASSIFICATION  OF  THIS  PAOEQWnn  Dmt*  Enter'd) 


16.  (Continued) 

Force  Base,  OH  45433. 


20.  (Continued) 

, 

^  This  method  thus  has  the  advantage  over  other  techniques 
of  being  able  to  predict  the  actual  learning  curve  slope 


more  accurately  before  any  production  takes  place,  and 
thus  yield  more  reliable  cost  estimates.  For  that  reason, 
it  represents  a  significant  advance  in  the  state  of  the 
art  of  parametric  airframe  cost  estimation y/ 


AFIT/GOR/SM/79D-4 


AIRCRAFT  AIRFRAME  COST  ESTIMATION 
USING  A  RANDOM  COEFFICIENTS  MODEL 


THESIS 


Presented  to  the  Faculty  of  the  School  of  Engineering 
of  the  Air  Force  Institute  of  Technology 
Air  University 

In  Partial  Fulfillment  of  the 
Requirements  for  the  Degree  of 
Master  of  Science 


by 

James  H.  Hinch 
Captain  USAF 

Graduate  Operations  Research 
December  1979 


Approved  for  public  release;  distribution  unlimited 


Preface 


The  possibility  of  differing  learning  curve  slopes  for 
different  types  of  aircraft  has  not  been  explicitly 
addressed  in  previous  aircraft  airframe  cost  estimation 
studies.  Previous  results  indicate  that  the  learning  curve 
slopes  do  differ  significantly.  The  random  coefficients 
model  provides  one  method  of  quantifying  this  difference. 
The  Introduction,  Chapter  I,  the  Data,  Chapter  II,  and  the 
Conclusions,  Chapter  V,  discuss  the  overall  project,  the 
data  base,  and  the  findings  of  this  study  in  non-mathemati- 
cal  terms.  The  remaining  chapters  present  the  model  devel¬ 
opment  and  results. 

I  extend  my  deepest  thanks  to  Lieutenant  Colonel 
McNicolls,  my  thesis  advisor,  for  his  continuous  help,  and 
to  Dr.  N.  Keith  Corner,  who  is  primarily  responsible  for 
this  study's  theoretical  development.  I  must  also  thank  my 
wife,  Alda,  for  her  great  understanding  and  encouragement. 
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Previous  studies  into  aircraft  airframe  acquisition 
costs  have  either  not  dealt  with  the  learning  phenomenon  or 
have  assumed  that  the  learning  curve  slope  is  the  same  for 
all  types  of  aircraft.  However,  some  results  have  indicated 
that  this  is  not  truly  the  case.  The  random  coefficients 
model,  as  applied  in  this  study,  provides  a  framework  in 
which  the  slopes  can  differ  by  estimating  their  values 
based  on  other  characteristics  of  the  aircraft.  This  method 
thus  has  the  advantage  over  other  techniques  of  being  able 
to  predict  the  actual  learning  curve  slope  more  accurately 
before  any  production  takes  place,  and  thus  yield  more 
reliable  cost  estimates.  For  that  reason,  it  represents  a 
significant  advance  in  the  state  of  the  art  of  parametric 
airframe  cost  estimation. 


AIRCRAFT  AIRFRAME  COST  ESTIMATION  USING 


A  RANDOM  COEFFICIENTS  MODEL 

I.  Introduction 

Any  item  up  for  sale  has  both  a  price  and  a  value  for 
a  potential  buyer.  In  order  for  the  sale  to  take  place, 
the  buyer  must  believe  that  the  item's  value  is  at  least  as 
great  as  its  price.  But  in  some  cases  the  value  and  price 
are  not  known  precisely.  For  relatively  inexpensive  items 
this  uncertainty  might  not  be  important,  but  as  price 
increases  the  buyer  becomes  more  and  more  interested  in 
reducing  his  risk.  For  very  large  purchases  he  is  extremely 
interested  in  improving  the  accuracy  of  cost  estimates,  so 
that  he  can  realistically  evaluate  alternatives. 

One  very  costly  item  for  DoD  is  aircraft.  Aircraft 
purchase  contracts  run  into  billions  of  dollars  and  make  up 
a  significant  part  of  the  defense  budget.  Cost  estimates 
for  aircraft  can  help  managers  make  better  decisions,  but 
these  estimates  can  be  inaccurate  for  two  primary  reasons. 
First,  if  any  aspect  of  the  aircraft  itself  is  changed,  the 
cost  estimate  might  be  invalidated  to  a  certain  degree. 
Second,  all  cost  estimation  techniques  contain  some  inher¬ 
ent  amount  of  error.  It  is  this  latter  error  that  analysts 
have  been  continually  trying  to  reduce  through  improved 
estimation  methods. 

In  the  last  fifteen  years  the  parametric  statistical 
method  of  generating  cost  estimating  relationships  (CER's) 
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has  enjoyed  increased  popularity.  Under  this  approach, 
historical  data  is  used  to  find  a  relationship  between 

chosen  parameters  of  the  item  and  its  cost.  The  same  rela¬ 
tionship  is  then  assumed  to  hold  for  future  purchases  of  a 
similar  item,  so  that  its  cost  can  be  estimated  from  its 
known  parameters. 

This  paper  will  develop  a  CER  to  predict  aircraft 
aiframe  acquisition  costs  for  fighter  aircraft.  An  air¬ 
frame  is  the  basic  structure  of  an  aircraft  minus  such 
things  as  wheels,  engines,  and  electronics.  Aircraft  are 
purchased  in  "lots"  from  the  selected  company.  Lots  may 
vary  in  size,  but  are  normally  one  year's  buy. 

Cost  Elements 

The  cost  of  a  lot  can  be  broken  down  into  more  basic 
components  of  tooling  cost,  engineering  cost,  material  cost 
and  labor  cost.  Each  component  is  called  a  cost  element. 
The  parametric  statistical  method  can  thus  be  applied  in 
two  different  ways:  (1)  it  can  be  used  to  estimate  the 
lot  cost  directly,  or  (2)  separate  CER's  can  be  found  for 
each  cost  element  and  the  results  summed  to  obtain  the 
total  cost.  A  RAND  Corporation  study  found  that  neither 
technique  has  significantly  greater  accuracy  (Ref  8:3-6). 
Since  this  paper  is  based  on  previous  reports  which  used 
the  cost  element  approach,  and  because  it  provides  managers 
with  more  information  about  the  source  of  costs,  that 
approach  will  also  be  used  here. 


Model  Formulation 

Several  different  types  of  equations  could  be  used  for 
the  basic  form  of  the  CER,  such  as  linear  (Y»A+BX+e) ,  expo- 
nential  (Y»AX  +e) ,  and  log-linear  (Y=AX  e) .  In  these  equa¬ 
tions,  Y  represents  the  cost  element,  X  represents  the  set 
of  explanatory  variables,  A  and  B  represent  unknown  con¬ 
stants  to  be  .found  using  statistical  techniques,  and  e  is 
an  error  term.  Of  these  forms  the  linear  is  the  simplest 
to  use,  but  it  does  not  fit  airframe  data  as  well  as  the 
other  forms.  Because  the  right  hand  side  of  the  exponential 
form  contains  an  additive  term,  a  multiplicative  term,  and 
an  exponential  term,  it  cannot  readily  be  transformed  into 
a  linear  equation.  Furthermore,  the  fact  that  the  error 
term  is  additive  assumes  that  the  error  is  of  the  same  mag¬ 
nitude  regardless  of  the  size  of  Y.  This  does  not  make 
much  sense  for  airframe  cost  data.  The  log-linear  form  does 
fit  the  data  quite  well,  and  it  can  be  linearized  to  facili¬ 
tate  calculations  by  taking  natural  logarithms  of  both  sides 
of  the  equation  (In  Y  =  In  A  +  B  In  X  +  In  e) .  For  these 
reasons  the  log-linear  form  is  the  one  most  commonly  used 
in  CER' s. 

Historical  data  can  also  be  applied  to  the  model  in 
two  different  ways.  First,  each  lot  purchase  can  be  treated 
as  an  individual  data  point  usiny  cumulative  quantities  and 
costs .  The  RAND  report  by  Timson  and  Tihanrky  (Ref  8)  is 
an  example  of  this  type  of  data  formulation.  One  problem 
with  this  method  is  the  correlation  between  data  points  for 
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the  same  aircraft  type.  A  second  formulation  is  to  modify 
the  data  to  produce  only  one  data  point  for  each  aircraft 
type.  This  eliminates  the  correlation  problem,  but  also 
reduces  the  size  of  the  available  data  base,  thus  ignoring 
some  of  the  available  information.  The  RAND  report  by 
Large,  Campbell,  and  Cates  (Ref  4)  uses  this  second  method 
of  data  formulation. 

Estimating  the  unknown  coefficients  can  then  be  done 
by  the  technique  of  "least  squares"  (minimizing  the  sum  of 
the  squares  of  the  residuals  between  the  actual  data  and 
predicted  values) .  A  basic  assumption  of  this  technique 
is  that  the  error  term  (In  e,  for  the  log-linear  form) 
follows  a  random  normal  distribution  with  a  mean  of  zero 
(Ref  7:110-111).  Since  each  residual  is  simply  a  sample 
of  the  error  term,  a  plot  of  the  residuals  should  also  show 
a  random  normal  distribution. 

In  a  recent  thesis  (Ref  5)  Marcotte  developed  several 
CER's  for  fighter  aircraft  airframes  using  the  log-linear 
model.  A  plot  of  the  residuals  from  the  CER  for  recurring 
labor  hours  indicates  that  within  each  aircraft  type  the 
residuals  are  not  completely  random  (Fig.  1) ,  in  violation 
of  the  assumption.  In  fact,  there  appears  to  be  significant 
correlation  between  the  residuals  and  the  natural  log  of 
the  quantity  purchased  (In  Q) ,  which  was  one  of  the  explan¬ 
atory  variables.  In  other  words,  the  best  value  for  the 
coefficient  of  In  Q  appears  to  vary  between  different 
aircraft. 


The  above  suggests  the  use  of  a  random  coefficients 

model,  which  will  allow  the  coefficient  to  vary.  This 

paper  analyzes  the  data  used  by  Marcotte  using  the  random 

coefficients  model  discussed  by  Amemiya  (Ref  1) .  The 

u  .  d.  d_  d, 

basic  equation  is  Y  =  aFS  .  .Q^  u  where  Y  is  the 

cost  element,  W  is  airframe  empty  weight,  S  is  aircraft 
maximum  speed,  is  the  quantity  of  the  i'th  aircraft 
type,  u  is  the  error  term,  and  a,b,c,  and  all  of  the  d's 
are  unknown  constants.  In  the  actual  prediction  equation 
only  the  Q  for  the  aircraft  type  involved  would  be  used. 

Note  that  this  allows  each  aircraft  type  to  have  a  differ¬ 
ent  value  for  d,  as  desired.  A  second  equation  is  needed 
therefore  to  calculate  this  value,  D  =  RX  +  v  where  D  is 
the  matrix  of  d's,  X  is  the  matrix  of  explanatory  variables 
(aircraft  parameters),  R  is  the  matrix  of  coef ficients , 
and  v  is  a  second  error  term.  This  completes  the  model 
formulation  used  in  this  paper. 

Problem 

The  basic  problem  addressed  in  this  paper  was  the 
elimination  of  the  nonrandomness  that  occurred  in  Marcotte' s 
model.  The  same  basic  functional  form  (log-linear)  as  well 
as  the  same  data  (weight,  speed,  and  quantity)  was  used, 
except  that  additional  parameters  were  needed  to  explain 
the  observed  correlation.  With  that  correlation  accounted 
for,  the  result  should  be  a  CER  which  fits  all  model 
assumptions  and  yields  smaller  variance  than  previous 
techniques. 
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Procedure 


The  first  step  required  was  to  develop  a  theoretical 
technique  which  could  be  applied  to  the  basic  model 
structure  and  which  could  explain  the  observed  correlation 
Additional  techniques  were  then  applied  to  solve  for  the 
desired  coefficients  using  maximum  liklihood  equations. 

The  specific.  CER's  resulted  from  using  these  coefficients 
in  the  basic  model. 

The  second  step  was  to  test  several  aircraft  param¬ 
eters  to  find  one  or  more  which  minimized  the  variances 
while  eliminating  as  much  as  possible  of  the  nonrandomness 
observed  by  Marcotte.  Computer  facilities  were  required 
for  this  analysis.  The  final  result  was  CER's  that  should 
be  capable  of  more  accurate  cost  predictions  than  pre¬ 
viously  possible. 

Organization 

This  study  is  organized  in  five  chapters.  Chapter 
II  provides  further  information  about  the  data  base 
employed.  Chapter  III  uses  calculus,  matrix  theory,  and 
probability  theory  to  expand  the  basic  model  into  a  form 
which  can  be  applied  to  the  data  to  produce  the  desired 
CER's.  Chapter  IV  analyzes  the  computer  results  and  the 
final  CER's.  Finally,  chapter  V  contains  a  summary  and 
conclusions  of  this  study. 


II.  Data  and  Cost  Elements 


Scope  and  Limitations 

A  question  arises  as  to  what  data  to  include  in  the 
data  base.  From  a  statistical  viewpoint  it  is  desirable 
to  have  as  large  a  data  base  as  possible  in  producing  CER's 
in  order  to  increase  the  confidence  in  the  final  result. 
However/  it  is  also  desirable  for  the  data  to  be  as 
homogeneous  as  possible. 

Aircraft  are  produced  in  a  great  variety  of  sizes  and 
shapes/  and  for  different  purposes.  A  CER  that  is  good 
for  one  kind  might  not  be  completely  accurate  for  another. 
Thus,  it  is  possible  to  construct  the  data  base  in  two 
different  ways  (Ref  4:10-12).  First,  information  on  all 
kinds  of  aircraft  could  be  included  to  maximize  the  amount 
of  data  used.  This  method  is  generally  better  in  pre¬ 
dicting  costs  for  an  aircraft  where  few  similar  kinds  have 
previously  been  produced.  Second,  in  deriving  a  CER  for 
a  particular  aircraft  type,  only  data  from  similar  types 
might  be  included  in  order  to  improve  the  data's  consistency. 
If  the  aircraft  is  fairly  similar  to  a  number  of  previous 
models,  then  this  second  method  is  preferable.  Hence,  the 
decision  on  which  method  to  use  depends  on  the  purpose 
involved . 

This  paper  does  not  attempt  to  predict  the  cost  of 
any  particular  aircraft.  Instead,  its  purpose  is  to  test 
the  validity  of  a  specific  theoretical  technique  in 
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creating  CER's.  A  large  data  base  is  not  required  for 
that  purpose.  Therefore,  this  study  uses  only  fairly 
recent  information  on  fighter/ trainer  type  aircraft, 
which  is  the  same  data  used  by  Marcotte.  It  consists  of 
33  lots  on  9  aircraft  as  follows: 


A-4D  (3) 

F-105B,D  (4) 

F-4A,B  (4) 

F-102A  (4) 

F-106A, B  (5) 

T-38A  (3) 

F-104A,B,C  (3) 

A-  5  A ,  C  (4) 

F-111A,C,E  (3) 

All  cost  data  used  is  unclassified.  However,  it  is 
considered  privileged  information  and  therefore  is 
included  under  separate  cover. 

The  data  used  in  this  study  can  be  broken  down  into 
two  basic  categories.  The  first  category  is  all  infor¬ 
mation  pertaining  to  the  cost  elements.  The  second  is 
the  collection  of  characteristics  of  the  aircraft  them¬ 
selves  which  might  be  used  to  predict  values  for  the 
cost  elements  through  CER's.  Table  I  contains  the  values 
for  these  characteristics. 

Source 

All  cost  element  data  used  in  this  study  was  originally 
compiled  by  the  RAND  Corporation  in  support  of  document 
R-1693-PA&E  by  Large,  Campbell,  and  Cates  (Ref  4).  The 
cost  of  each  lot  is  adjusted  for  design  changes  to  improve 
the  consistency.  The  data  is  available  at  the  Cost 
Library,  Aeronautical  Systems  Division,  Wright-Patterson 
Air  Force  Base,  Ohio. 
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The  majority  of  the  aircraft  parameter  data  is  also 
taken  from  RAND  report  R-1693-PA&E  (Ref  4:13).  The  report 
also  contains  a  more  complete  explanation  of  the  terms  and 
a  discussion  regarding  the  usefulness  of  the  parameters 
in  deriving  CER's  (Ref  4:12-15).  Additional  information 
is  also  taken  from  aircraft  summary  sheets  at  the  ASD 
Cost  Library. 

Data  Reduction 

Cost  data  should  be  expressed  in  terms  of  man-hours 
of  labor  instead  of  dollars  whenever  possible  (Ref  4:15). 
The  use  of  actual  dollar  amounts  allows  several  possible 
sources  of  error  to  affect  the  data,  such  as  wage  rate 
variations  between  contractors  or  between  geographical 
locations.  Dollar  amounts  would  also  have  to  be  adjusted 
to  constant  year  dollars  through  price  indexes,  which  are 
never  completely  exact.  To  avoid  these  problems,  this 
study  uses  man-hours  instead  of  dollars  to  represent 
tooling,  engineering,  and  labor  costs. 

Manufacturing  material  costs  cannot  logically  be 
represented  ir.  terms  of  man-hours.  Instead,  the  actual 
dollar  amounts  must  be  used.  To  eliminate  the  effects 
of  inflation,  the  dollar  amounts  are  adjusted  to  constant 
1973  dollars  using  the  price  adjustment  indices  in  Table 
II  (ref  4:32) . 

The  data  for  each  of  the  cost  elements  can  be  divided 
into  recurring  and  non-recurring  amounts.  Previous  RAND 
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TABLE  II 


Price  Adjustment  Indices 
1973 


Year 

Material 

Equipment 

1952 

2.625 

2.972 

1953 

2.480 

2.808 

1954 

2.359 

2.656 

1955 

2.224 

2.506 

1956 

2.081 

2.353 

1957 

1.970 

2.226 

1958 

1.859 

1.078 

1959 

1.793 

1.981 

1960 

1.718 

1.892 

1961 

1.672 

1.833 

1962 

1.614 

1.756 

1963 

1.579 

1.696 

1964 

1.528 

1.632 

1965 

1.479 

1.568 

1966 

1.422 

1.496 

1967 

1.359 

1.422 

1968 

1.295 

1.343 

1969 

1.208 

1.249 

1970 

1.177 

1.188 

1971 

1.137 

1.138 

1972 

1.094 

1.081 

1973 

1.000 

1.000 

Corporation  studies  indicate  that  recurring  costs  only 
should  be  used  where  possible  (Ref  5:24).  In  some  cases 
however,  there  are  discrepancies  in  contractor  reported 
data  for  the  breakdown  of  tooling  and  engineering  costs 
into  recurring/non-recurring  amounts.  Therefore,  the 
tooling  and  engineering  hours  used  in  the  data  base  for 
this  study  represent  total  hours.  Labor  hours  and  material 
costs  represent  recurring  amounts  only. 

Cost  Element  Definitions 

Tooling  costs  refer  to  the  costs  of  producing  and 
using  tools  which  are  required  to  produce  a  particular 
model  of  aircraft.  Many  tools,  such  as  assembly  tools, 
dies,  jigs,  fixtures,  work  platforms,  and  test  equipment, 
can  be  used  only  on  one  model.  Thus  the  total  cost  of 
producing  these  tools  and  replacements  are  applied  to  the 
program.  More  general  purpose  tools,  such  as  milling 
machines,  presses,  routers,  and  lathes  are  considered 
capital  equipment.  An  allowance  for  depreciation  on 
these  machines  is  added  to  the  tooling  costs  for  the 
aircraft  model  (Ref  4:23). 

Engineering  costs  include  engineering  for  the  basic 
airframe  and  systems  engineering  work  done  by  the  prime 
contractor.  It  includes  engineering  for  design  studies 
and  integration,  wind-tunnel  models,  laboratory  testing, 
preparation  and  maintenance  of  drawings  and  specifications, 
and  reliability.  Only  those  hours  directly  attributable 
to  the  aircraft  itself  are  included  (Ref  4:13). 


Manufacturing  labor  costs  refer  to  all  direct  labor 
necessary  to  machine,  fabricate,  and  assemble  the  major 
aircraft  structure.  This  includes  the  direct  labor  portion 
of  components  that  are  built  off-site.  The  labor  necessary 
to  install  these  components  and  purchased  components  is 
also  included.  Man-hours  required  to  fabricate  the 
purchased  parts  and  materials  are  excluded  (Ref  4:27-28). 

Manufacturing  materials  include  raw  and  semifabricated 
materials  plus  purchased  parts  and  equipment  used  in  the 
construction  of  the  airframe.  Purchased  parts  are  general 
hardware  items  such  as  electrical  fittings,  valves,  and 
hydraulic  fixtures.  Purchased  equipment  is  other  general 
purpose  items  such  as  actuators,  motors,  generators, 
landing  gear,  instruments,  and  hydraulic  pumps.  Such 
equipment  may  be  purchased  by  the  contractor  or  provided 
by  the  government.  However,  if  a  piece  of  equipment  is 
specifically  designed  for  a  particular  aircraft,  it  is 
considered  subcontracted  and  excluded  from  manufacturing 
materials  (Ref  4:31). 
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III.  Mathematical  Development 


This  chapter  begins  with  brief  descriptions  or  ordinary 
least  squares  and  generalized  least  squares  procedures. 
Readers  who  desire  a  more  complete  description  should  refer 
to  Mendenhall  and  Scheaffer  (Ref  6:  Chap.  11)  or  Theil 
(Ref  7:  Chap.  3,6).  The  basic  model  used  in  this  paper  is 
then  presented  in  greater  detail  and  developed  into  a  form 
that  is  directly  applicable  to  the  purpose  of  deriving 
CER's.  An  understanding  of  matrix  algebra,  calculus,  and 
probability  theory  is  required  in  the  development.  Finally, 
the  computer  algorithm  used  to  compute  the  unknown  CER 
coefficients  i3  derived  from  the  mathematical  formulation. 

Notation 

Matrix  notation  is  used  throughout  the  remainder  of 
this  thesis.  Capital  letters  are  used  to  denote  matrices 
and  vectors.  Lower  case  letters  denote  scalars  and  elements 
of  matrices  and  vectors. 

An  apostrophe  following  a  capital  letter  (A’)  indicates 
the  transpose  of  the  matrix  or  vector.  The  inverse  of  a 
matrix  is  denoted  with  an  exponent  of  minus  one  (A  1 ) .  When 
the  elements  of  a  vector  are  averaged  and  that  average  is 
placed  in  each  position  of  a  new  vector  of  the  same  size, 
the  vector  of  averages  is  denoted  with  a  bar  over  the  symbol 
(A)  . 
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In  some  cases  an  approximation  or  estimate  of  some 
unknown  is  required.  The  estimate  or  matrix  of  estimates 
is  indicated  by  a  hat  over  the  symbol  (A) . 

The  determinant  of  a  matrix  is  denoted  by  |A| .  The 
notation  tr(A)  indicates  the  trace  of  the  matrix  A,  which 
is  the  sum  of  the  elements  on  the  main  diagonal.  The 
expression  <  ~>fx]  denotes  e  raised  to  the  power  x,  where  e 
is  the  base  of  the  natural  logarithm. 

Ordinary  Least  Squares 

The  ordinary  least  squares  procedure  starts  with  the 
equation  for  the  general  linear  model.  The  equation  is  of 
the  form 


Y  *  XB  +  e  (1) 

where  Y  is  the  vector  of  known  values  for  the  dependent 
variable  (e.g.  cost  element) ,  X  is  the  matrix  of  known 
values  for  the  independent,  predictor  variables  (e.g.  air¬ 
craft  parameters),  8  is  the  vector  of  unknown  coefficients, 
and  e  is  the  vector  of  error  terms. 

In  order  to  compute  values  for  the  unknown  coefficients, 
the  procedure  requires  the  following  assumptions  (Ref  7: 
110-111) : 

1.  The  values  of  the  independent  variables  (the 
columns  of  X)  are  linearly  independent. 

2.  The  error  term  values  are  random  and  follow  a 
normal  probability  distribution. 
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3.  For  any  values  in  X,  the  expected  value  of  the 

I 

error  term  is  zero  (E(e|X)  =  0). 

4.  The  variance-covariance  matrix  of  the  error  term 
is  given  by 

Var (e|X)  -  o2I  (2) 

2 

where  a  is  an  unknown  positive  number  and  I  is 
the  n  x  n  identity  matrix. 

With  these  assumptions,  application  of  the  maximum 
likelihood  estimation  technique  yields  the  following 
coefficient  estimates: 

£  »  (X'X)'1  X’Y  (3) 

Thus  the  estimates  are  found  directly  from  known  values 
of  the  independent  and  dependent  variables. 

Generalized  Least  Squares 

The  generalized  least  squares  technique  is  identical 
except  that  the  fourth  assumption  above  is  relaxed.  This 
technique  assumes  that  the  variance-covariance  matrix  of 
the  error  term  is  given  by 

Var  te|X)  »  a2V  (4) 

2 

where  a  is  an  unknown  positive  constant  and  V  is  a  known, 
symmetric,  positive-definite  matrix  whose  trace  equals  n 
(JRef  7:  237).  The  coefficients  estimates  then  become 

(.  6  -  tX'V^X)"1  X'V^Y  (5) 
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Model  Development 

The  basic  model  used  in  this  study  is  a  random  coef¬ 
ficients  model  in  which  some  of  the  coefficients  are  assumed 
to  be  the  dependent  variables  in  a  second  linear  regression. 
The  source  of  this  structure  is  an  article  by  Amemiya 
(Ref  1:793) .  In  its  most  general  form  the  model  can  be 
defined  by  the  equations 


Y  «* 

Po 

+  ZB  +  U 

(6) 

B  = 

Xy 

+  V 

(7) 

where  the  vector  Y  and  matrices  P,  Z,  and  X  consist  of 
known  constants ,  a  and  Y  are  vectors  of  unknown  parameters , 

B  is  a  vector  of  random  coefficients,  and  U  and  V  are  random 
error  vectors. 

As  in  other  models,  the  solution  requires  certain 
assumptions : 

1.  The  error  terms  U  and  V  are  uncorrelated  with  each 
other. 

2.  The  error  terms  follow  normal  probability  distri¬ 
butions  with  zero  means. 

3.  The  variance-covariance  matrices  are  given  by 


Var(U)  -  ojl 

(8) 

Var(V)  «  o*l 

(9) 

2  2 

where  and  o‘  are  unknown  positive  parameters  and 
the  I’s  are  identity  matrices  of  appropriate  size. 


4.  Y  is  observed  but  0  is  not. 

5.  The  matrices  X  and  (P,Z)  are  full-rank. 

equations  6  and  7  yields 

Combining 

Y  »  Pa  +  ZXy  +  ZV  +  0 

(10) 

Some  additional  definitions  are  useful  in  deriving  a  solu¬ 
tion  to  this  combined  equation: 

G  -  (P,ZX] 

(11) 

•  -  [?] 

(12) 

e  -  zv  +  u 

(13) 

F  «  Y  -  Gp 

(14) 

Note  that  G  consists  of  known  constants,  p  contains  the 

unknown  coefficients,  and  e  contains  both  error  terms.  With 

these  new  definitions,  equation  10  becomes 

Y  *  Gp  +  e 

(15) 

which  now  appears  to  be  in  standard  linear  form. 

error  term,  e,  has  a  variance-covariance  matrix 

The  new 

defined  by 

Z  -  VarCe)  *  crj  ZZ'  +  o*I 

(16) 

which  is  both  positive  definite  and  symmetric. 

Thus,  there  are  two  sets  of  unknowns:  the  coefficients 
in  p  and  the  variances  in  I.  If  either  one  were  known,  it 
would  be  possible  to  derive  a  maximum  likelihood  estimate 
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for  the  other.  Therefore,  this  study  takes  the  following 
approach : 

1.  Start  with  an  initial  guess  for  p  (p) . 

2.  Using  this  p  find  I  which  maximizes  the  likelihood 
function  of  £  given  p  (L(£|p)). 

A  A 

3.  Use  £  to  find  p  which  maximizes  the  likelihood 
function  of  p  given  E  (L(p|£)). 

4.  Repeat  steps  two  and  three  until  the  sequence 
converges . 

This  approach  should  produce  maximum  likelihood  estimates 
for  p  and  I.  It  remains  to  develop  specific  procedures  to 
maximize  the  appropriate  likelihood  functions. 

When  I  is  assumed  to  be  known,  the  model  and  assump¬ 
tions  meet  all  the  requirements  of  generalized  least 
squares.  The  solution  which  maximizes  L(p|l)  is  therefore 

p  -  (G' f-1G) _1  G,r”1Y  (17) 

To  begin  the  estimation  of  I ,  note  that  both  error 
terms  (U  and  V)  are  assumed  to  follow  normal  probability 
distributions.  Therefore,  the  likelihood  function  is  given 
by 

L  (E  |  p)  -  (1/27T)  n|  r  |  "1/2  exp[  (-l^jF'E^FJ  (18) 

where  n  is  the  number  of  observations  in  the  sample.  Since 
p  is  given,  F  is  known.  The  only  unknowns  are  the  variances, 
ou  and  o‘,  which  are  included  in  E.  They  can  be  found  by 
solving  the  simultaneous  equations 
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3L(Z|p)/3o*  -  0  (19) 

3L(Z|p)/3aJ  -  0  (20) 

<<s 

which  then  yield  the  maximum  likelihood  estimate  Z. 

Some  intermediate  results  are  provided  first.  Theil 
(Ref  7:33)  shows  that 

dA'Vax  -  -A"1(3A/3x)A"1  (21) 

where  A  is  a  function  of  x.  This  together  with  equation  16 
gives 

az'VaoJ  -  -z”1oz/3cJ)z"1  •  -z"1z"1  (22) 

az'VaaJ  -  -z“1oz/acJ)z“1  -  -z"1zz,z_1  (23) 

Graybill  (Ref  2:266)  proves  that 

3|A|/3x  -  tr( |A|A”1C3A,/ax) ]  (24) 

where  A  again  is  a  function  of  x.  Since  Z  la  symmetric/ 
so  is  Z*1 ,  thus 

d| Z"1|/3a^  -  tr{|Z“1|Z(3Z’1/acJ)]  (25) 

a|Z’1|/3cJ  -  tr[|Z_1|ZtaE’1/acJ)J  (26) 

Combining  these  equations  with  22  and  23  gives 

a | Z”1|/3a^  -  tr(-| Z”1! Z”1)  (27) 

a  I Z*1 1 /3aJ  -  tr(-|z"1|zz,z“1]  (28) 
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22 


c 


lyiiEl  .  (l/2ir)n(-l/2) \Z\~1/2  exp[  {-1/2)F* E-1F]  trUT1) 

L 


+  F'  F 

wwn  •* 


(35) 


Since  we  are  seeking  the  maximum  likelihood  point,  the 
derivative  is  now  set  equal  to  zero.  None  of  the  terms 
which  factored  out  can  possibly  equal  zero,  so  they  drop 
out  of  equation  35,  leaving 

.-1 


trU"1)  +  F 


,  U 


8c 


(36) 


u 


Using  equation  22  once  more  now  yields 

tr (I-1)  -  F' r-1E-1F  -  0 


(37) 


2 

The  analysis  for  the  derivative  with  respect  to  cy  is 


virtually  identical  and  yields 


tr (zz 1 1"*1)  -  F,s“1zz,r"1r  -  o 


(38) 


(J 


We  now  have  two  equations  (37  and  36)  with  which  to  solve 

2  2 

for  the  two  unknowns  (o‘  and  a*) . 

Some  additional  definitions  become  useful  at  this  point. 
Define 


4  - 


U/c*)I 


(39) 

(40) 


which  combine  to  give 


H  -  «ZZ'  +  I 


(41) 
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Also  note  that 


E"1  =  (l/oJ|)  H"1  (42) 

Combining  42  and  37  gives 

(l/oJjtrdT1)  -  (l/oJ^F'H”1!!' XF  *  0  (43) 

2 

which  can  be  solved  for  o‘  yielding 

0*  -  F,H**1H*"1F/tr  (H_1)  (44) 

Similarly,  equations  42  and  38  yield 

aj  «*  F,H"LZZ'H~1F/tr  (ZZ'H-1)  (45) 

Equating  44  and  45  now  gives 

F,H"1ZZ,H~1F/tr(ZZ'H*'1)  «  F ' H”^H”1F/tr  (H~*)  (46) 


which  reduces  to 


,  H^SZ'h"*  _  H~1H~11 

.trCZZ'H"1)  tr (H"1-)  J 


Note  that  H  and  hence  H~  are  functions  of  only  one 
unknown  variable  6.  With  one  equation  and  one  unknown  it 
should  now  be  fairly  simple  to  find  the  root  5,  then  use  it 
in  equations  44  and  39  to  find  the  values  for  c*  and  a‘. 

The  matrix  H  is  of  size  n  x  n.  If  n  is  large,  invert¬ 
ing  H  as  required  above  could  introduce  significant  error. 
Due  to  the  nature  of  this  particular  study,  that  problem 
can  be  eliminated  through  some  additional  analysis. 
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The  matrix  Z  in  this  study  is  of  the  form 


Z11 

Z21 


0  0  . 

0  0  . 


0 


0 


“31 


0 


0 


0 


Z12 


0 


0 


0 

0 


Z 


•32 


0 


(48) 


The  following  section  explains  the  correspondence  between 
the  equations  from  Chapter  I  and  this  general  model. 

Matrix  Z  contains  k  columns,  one  for  each  aircraft  type. 
Each  row  represents  one  lot  purchase,  which  is  made  on  only 
one  particular  type  of  aircraft.  Thus,  7,  contains  a  zero 
when  the  aircraft  type  for  the  row  disagrees  with  the  type 
for  the  column. 

Each  column  of  Z  contains  a  certain  number  of  non-zero 
elements  depending  on  the  number  of  lots  for  that  aircraft 
type  included  in  the  data  base.  Define  as  the  vector 
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of  non-zero  elements  In  the  i'th  column.  The  full  matrix 
Z  can  now  be  written  as 


with  all  the  blanks  being  2eros.  Also,  ZZ*  becomes 


ZlZi 


Z-.Z* 


Z3Z3 


Vi 


which  is  an  n  x  n  matrix  containing  the  square  blocks  Z^Z| 
along  the  main  diagonal  and  zeros  elsewhere. 

Define  as 


Hi  «  6ZiZi  + 


where  is  the  identity  matrix  of  the  same  size  as  Z^Z^. 
Note  that  H  is  now  an  n  x  n  matrix  with  square  blocks  on 
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the  diagonal  and  zeros  elsewhere.  A  theorem  by  Graybill 
(Ref  2:170)  provides 


( 


1L  -  5(1  + 


* E  i  2  .-1 
6Ij=lzji) 


ZiZi 


(52) 


where  Z  denotes  a  summation  here  and  n^  is  the  number  of 
elements  in  Z^.  Define 


t. 

i 


so  that 


-1 


becomes 


2 

ji 


ZiZi 


(53) 


H"1  -  1L  -  6ZiZ^/(l  +  <Sti) 


(54) 


Finally,  H_1  is  the  matrix  with  square  blocks  H?1  on  the 
diagonal  and  zeros  elsewhere  (Ref  2:164-165) .  Hence  we  can 
compute  H-1  directly  from  Z  and  6  to  minimize  the  computer 
roundoff  error. 

The  trace  of  H-^  can  also  be  found  easily: 


tr (H_1)  *  Ejol  tr(H"X)  (55) 

trCH"1)  =  l*ml  [nj,  -  6  tr(ZiZ|/(l  +  6t±) )  1  (56) 

The  n^'s  sum  to  n  and 

tr(Zizp  -  tL  (57) 

so  that 

tr(H-1)  -  n  -  rj. x  6tt/(l  +  6ti)  (58) 

tr (H*1)  -  n  -  k  +  l*ml  (1/(1  +  6ti)J  (59) 
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This  completes  the  theoretical  development  of  the  general 
model . 

Computer  Algorithm  Development 

Chapter  I  stated  that  the  equations  to  be  used  in  this 
study  are 

y  <=  aWbScQ^1Q22  .  .  .  Q^U  (60) 

and 


D  -  XR  +  V  (61) 

By  taking  the  natural  logarithm  of  both  sides  of  equation  60 
it  becomes 

In  Y  *  In  a  +  b  In  W  +  c  In  S  +  I^^d^  ln  Qi  +  In  U 

(62) 

With  appropriate  definitions  equations  62  and  61  can  now  be 
put  in  the  form  of  the  general  model  of  equations  6  and  7. 
Define  a  new  vector  Y  as  the  vector  of  the  elements  of  ln  Y 
in  equation  62.  Define 

P  -  [J,  ln  W,  ln  SJ  (63) 

where  J  is  a  vector  of  all  ones.  Define 


Z  »  [ln  Q1#  ln  Q2,  ...,  ln  QkJ 


a 


'in  a" 
b 


(64) 


(65) 


c 


8 


D  *  [d^,  ^2 ,  •  «., 


(66) 


dkJ’ 


Y  -  R 


(67) 


Lastly,  let  the  new  vector  U  contain  the  elements  of  In  U 
from  equation  62.  The  transformation  is  now  complete. 

The  final  step  is  to  properly  sequence  the  theoretical 
steps  for  use  by  the  computer.  Since  equation  47  is  not 
easily  differentiable/  the  secant  method  (Ref  3:70-71)  is 
used  in  this  study  to  find  its  root.  The  outline  of  the 
required  steps  is  therefore  as  follows : 

1.  Read  in  all  data/  including  an  initial  guess  p 
and  initial  values  6^  and  6^  which  are  required  to 
begin  the  secant  method  iteration. 

2.  Using  and  &2,  compute  H-1,  tr  H_1,  and 

F' (H"1ZZ,H"1/tr(ZZ,H_1)  -  H~1H~1/tr (H_1) ]F  which 
hereafter  is  called  F(6). 

3.  Use  the  results  of  step  2  according  to  the  secant 
method  to  find  a  new  approximation  for  the  root. 

4.  Repeat  steps  2  and  3  until  the  root  is  found  to 
the  desired  degree  of  accuracy. 

5.  Once  the  root  is  found,  use  it  to  compute 

2  2 

then  ou  and  ay  from  equations  45  and  39. 

6.  Calculate  £  from  equation  16. 

✓N 

7.  Calculate  the  new  p  from  equation  17. 

8.  Recompute  F  from  equation  14  and  find  the  new  F(<5) 
from  47.  If  it  is  not  sufficiently  close  to  zero, 
return  to  step  2. 
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9.  When  F(6)  remains  close  to  zero  using  the  new 

values  in  F ,  the  process  is  considered  to  be  con¬ 
verged  . 

10.  Output  all  desired  numbers/  including  the  maximum 
likelihood  coefficients  (p)  and  the  residuals  (F) . 
The  entire  procedure  is  thus  completed  and  the  model  solved 
for  the  desired  coefficients. 


i 

\ 
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IV.  Results  and  Analysis 


This  chapter  provides  some  additional  analysis  of  the 
overall  procedure/  then  presents  and  analyzes  the  results. 

It  begins  by  explaining  exactly  what  numbers  were  produced 
and  how  they  car  be  compared.  A  few  important  aspects  of 
the  computer  algorithm  are  then  discussed  in  greater  detail. 
The  following  sections  present  and  discuss  the  results  for 
each  of  the  four  cost  elements.  Finally,  some  of  the 
residuals  are  analyzed  to  determine  whether  or  not  this 
procedure  actually  solved  the  original  problem  of  non- 
randomness. 

Notation 

Table  III  presents  some  additional  definitions  which 
are  useful  in  discussing  the  CER  results  and  presenting  the 
final  equations. 

CER  Development 

The  computer  program  was  run  until  convergence  for  each 
of  the  four  cost  elements  using  each  of  the  nine  available 
explanatory  variables  individually  in  equation  61.  For 
comparison,  it  was  also  run  with  a  constant  term  only  in 
equation  61  by  letting  X  be  a  vector  of  ones.  The  results 
are  presented  in  Tables  IV-VII. 

It  should  be  noted  that  the  statistical  significance 
of  the  coefficients  is  not  presented  in  this  study.  That 
is  because  the  model  cannot  be  put  into  standard  linear 
form,  and  no  known  statistical  theory  exists  at  this  time 
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TABLE  III 
Definitions 


Acronym 

Definition 

AR 

Aspect  Ratio 

CEIL 

Ceiling 

CR 

Climb  Rate 

EH 

Engineering  Hours 

L/D 

Lift/Drag  Ratio 

LP 

Load  Factor 

LH 

Labor  Hours 

MC 

Material  Costs 

Q 

Cumulative  Lot  Quantity 

S 

Speed 

ST 

Static  Thrust 

TH 

Tooling  Hours 

T/W 

Thrust/Weight  Ratio 

W 

Weight 

WA 

Wing  Area 

WL 

Wing  Load 

which  could  be  used  to  calculate  the  significance  levels. 

2 

Therefore,  this  study  uses  the  R  values,  which  do  not 
require  statistical  tests,  to  compare  the  different  CER's. 

Computer  Algorithm  Analysis 

An  example  of  the  computer  program  is  given  in  Appen¬ 
dix  A.  During  the  program's  use,  some  interesting  aspects 
of  the  procedure  were  discovered. 

First,  the  value  of  6  which  satisfied  F(5)=0  (equation 
47)  on  the  first  iteration  was  very  sensitive  to  the  initial 

a* 

guess  for  p.  The  reason  was  that  the  parameters  in  X  were 
generally  much  larger  in  magnitude  than  the  coefficients 
in  D.  Thus,  even  small  errors  in  the  initial  guesses 
of  the  coefficients  of  X  had  a  very  significant  impact. 
Another  impact  of  the  difference  in  magnitudes  was  that  the 
final  values  in  y  were  sometimes  very  small.  For  this  data, 
then,  a  small  number  in  y  did  not  necessarily  mean  that  the 
associated  parameters  were  insignificant. 

The  sensitivity  of  6  to  p  did  not  create  a  serious 
problem.  Even  when  the  first  iteration  estimated  6  only 
roughly,  by  the  second  iteration  it  was  relatively  close  to 
its  final  value.  The  main  difficulty  was  the  fact  that  the 
secant  method,  which  was  used  to  converge  on  5,  also  required 

A  A 

initial  guesses  <5^  and  Sj*  Since  6  changed  so  much  at  the 
second  iteration,  the  program  was  built  to  allow  different 
initial  guesses  to  be  used  at  that  time. 

A  second  aspect  of  the  algorithm  was  that  the  initial 
guesses  5^  and  6^  had  to  be  chosen  carefully.  If  they  were 
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too  snail  the  secant  method  converged  slowly  and  a  lot  of 
computer  time  was  wasted.  But  because  of  the  shape  of  the 
function  F(6)  (Figure  2),  they  could  not  be  chosen  too 
large  either.  If  they  were  too  large  the  sequence  of  esti- 
mates  oscillated  between  the  positive  and  negative  sides  of 
the  function  and  diverged  away  from  the  root.  Thus#  a  cer¬ 
tain  amount  of  trial  and  error  was  r*  juired  to  find  the 
best  range  of  values  to  use. 

The  third  aspect  of  the  algorithm  concerned  the  con¬ 
vergence  criterion.  The  process  was  considered  converged 
when  F (6 )  was  within  a  desired  epsilon  of  zero  using  the 
new  values  for  the  vector  F  and  the  matrix  H~*.  This  study 
used  an  epsilon  of  10  .  This  value,  however,  did  not  guarantee 

that  the  final  coefficients  in  p  would  also  be  accurate  to 
six  significant  figures.  In  fact,  most  of  the  values  agreed 
to  only  four  or  five  digits  on  the  last  two  iterations. 

For  the  purpose  of  this  report,  that  degree  of  accuracy  was 
felt  to  be  sufficient.  The  actual  numbers  presented  in  the 
tables  are  given  to  the  number  of  digits  believed  to  be 
significant. 

Results 

This  section  presents  the  CER  results  for  each  cost 
element.  Some  significant  features  of  the  results  are  then 
discussed.  The  best  CER's,  as  indicated  by  this  study,  are 
presented  in  equation  form. 

The  results  for  the  tooling  hours  CER's  are  given  in 
Table  IV.  The  coefficients  for  the  CER  with  a  constant 
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term  only  in  X  differ  from  the  coefficients  found  in  previ¬ 
ous  studies  (Ref  5 >42)  but  are  generally  in  the  same  range 

2 

as  shown  in  Table  VIII.  The  R  values  average  about  .85/ 
which  is  the  lowest  of  all  the  cost  elements.  The  tooling 
hours  CER's  were  also  the  slowest  to  converge  during  the 
computer  runs. 

Of  special  note  is  that  some  of  the  R  values  are  lower 
with  a  parameter  included  in  X  than  for  the  constant-only 
CER.  The  only  possible  reason  for  this  lies  in  the  formu¬ 
lation  of  the  model.  Recalling  equation  10 

Y  -  Pa  +  ZXy  +  ZV  +  U  (10) 

we  can  see  that  X  does  not  enter  the  model  as  an  additive 

term.  Instead/  it  is  multiplied  by  Z  to  give  a  completely 

new  matrix  of  known  values.  Thus,  each  time  X  changes 

we're  solving  a  different  model/  so  it  is  quite  possible 
2 

for  R  to  decrease. 

The  CER  results  for  engineering  hours  are  given  in 

Table  V.  The  coefficients  for  the  constant-only  CER  agree 

very  closely  with  those  from  previous  studies  (Ref  5 >39)  as 

2 

seen  in  Table  VIII.  The  R  values  are  quite  high,  and  the 

computer  program  converged  much  faster  than  for  the  tooling 

2 

hours  CER's.  Note/  however ,  that  R  varies  very  little,  so 
for  prediction  purposes  no  particular  CER  would  be  much 
better  than  the  others. 
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Table  VI  presents  the  results  of  the  material  costs 

CER's.  As  with  tooling  hours,  the  coefficients  in  the 

constant-only  CER  do  show  some  difference  from  coefficients 

from  other  reports  (Ref  5:46).  The  R  values  are  moderately 

good,  as  was  the  convergence  rate  on  the  computer  runs.  A 

2 

few  of  the  CER's  do  show  significant  improvement  in  R  over 
the  constant-only  form. 

The  labor  hours  CER  results  are  shown  in  Table  VII.  As 

with  engineering  hours,  the  constant-only  coefficients  are 

quite  close  to  those  from  previous  studies  (Ref  5:44).  The 

convergence  rate  was  very  good,  and  the  R  values  are  quite 

2 

high.  One  CER  shows  a  substantial  improvement  in  R  over 
the  constant-only  value. 

2 

The  specific  CER's  which  give  the  highest  R  values  for 
each  cost  element  are  the  following: 


(. 


TH 

D 

EH 

D 

MC 

D 

LH 

D 


e9. 75236  w. 562233  g-.1384  Qdi 

.258238  +  (.0002040414) (WA) ; 
e. 07883  wl. 19191  g. 45729  Qdi 

.077030  +  (.01439) (LF) ; 

6. 1297  u. 801813  c. 13605  di 
e  w  b 

.434488  +  (.0289438) (L/D) ; 
e6.8917  wl. 18411  s-. 674521  Qdi 

.120814  +  (.0439328) (LF) 


(68) 

(69) 

(70) 

(71) 

(72) 

(73) 

(74) 

(75) 


39 


n 

« 

4J 

<D  — 

Soi* 

k 

«3 


4J 

S- 

4->  <H 
«<?- 
C 

o 

o 


■a 

0)  hi 
1KO 

a 

co  u 


•U  — 
£  CM 
&KO 
•HI 
QJ  II 

s 

O4  —  -Q 
C 


k 

4-1  0) 

<44  4-1 
<0  fy  X 
k  S 

O  to  c 

<  04 


© 

r- 

CM 

© 

co 

© 

rH 

© 

© 

n 

HI 

in 

© 

CM 

in 

© 

CM 

© 

© 

(M 

VO 

r- 

© 

HI 

© 

i—l 

CM 

© 

© 

H 

H 

m 

rH 

«*■ 

© 

© 

** 

© 

rH 

H 

H 

CM 

•H 

»H 

CM 

CO 

rH 

rH 

© 

• 

© 

• 

© 

• 

© 

• 

© 

• 

o\ 

• 

© 

1 

© 

• 

© 

• 

© 

• 

HI 

© 

rH 

f" 

© 

HI 

© 

H) 

© 

rH 

co 

(N 

r 

© 

rH 

CO 

© 

rH 

r- 

HI 

H 

© 

VO 

co 

00 

rH 

1 

O 

© 

© 

in 

rH 

© 

vo 

O 

© 

a\ 

o 

CO 

o 

CT\ 

© 

© 

O 

© 

o 

o 

rH 

o 

© 

© 

CM 

o 

© 

o 

o 

CM 

o 

CM 

o 

CO 

o 

© 

© 

o 

O 

© 

O 

o 

o 

r 

r 

r 

T 

ro 

H 

m 

r- 

CM 

© 

co 

00 

o 

r- 

o 

o 

© 

vo 

© 

CM 

© 

© 

00 

o 

CM 

o 

rH 

CM 

CM 

© 

CM 

r~ 

ro 

r- 

VO 

vO 

rH 

•4 

© 

© 

© 

© 

<N 

© 

O 

© 

CO 

r* 

o 

r~ 

• 

© 

m 

00 

• 

r- 

• 

r- 

• 

• 

vo 

* 

*? 

• 

© 

© 

• 

o 

CO 

r- 

© 

o 

ro 

CM 

© 

© 

o 

CM 

IT) 

in 

VO 

CM 

CM 

CO 

o 

© 

© 

in 

vo 

CD 

r* 

HI 

CO 

n 

© 

© 

rH 

r* 

r- 

VO 

rH 

m 

00 

co 

© 

© 

rH 

HI 

(N 

CM 

rH 

o 

o 

rH 

o 

rH 

i 

l’ 

r 

i 

i 

r 

r 

r 

xr 

CD 

© 

© 

ro 

rH 

'd1 

CM 

VO 

in 

•4 

rH 

© 

cn 

in 

r« 

i-l 

rH 

rH 

H 

© 

rH 

© 

ao 

r- 

CM 

© 

in 

H) 

rH 

© 

© 

CM 

vo 

© 

© 

r- 

o 

CM 

© 

CM 

CM 

O 

o 

00 

© 

© 

© 

© 

© 

© 

O 

rH 

H 

rH 

rH 

H 

o 

IN 

cm 

!**• 

o 

IM 

to 

CM 

«d* 

© 

rH 

m 

© 

m 

r- 

© 

© 

CM 

© 

© 

CM 

HI 

© 

© 

© 

CM 

rH 

CM 

m 

o 

CM 

«#• 

CM 

rH 

r- 

© 

CO 

r* 

r* 

© 

© 

hi 

© 

© 

© 

0) 

c 

H 

o 

s 

0 

W 

as 

«< 

« 

fr> 

\ 

&4 

\ 

c 

O 

U 

s 

* 

3 

M 

3 

H 

f 


V 

t 


?  l 

i 

> 

f 

| 


! 

I 

i 

'  \ 

*  i 


i 

1 

t 


f 


41 


*0 
0) 
Q> 
Ou 
m  co 

P 

c  c 

0)  rH 


<P  -v 

S  £ 

(J  Cn 


>  « 

W 

a  ° 

g  ° 

e 

o 

09 


ov 

CM 

in 

in 

VO 

CO 

vo 

CM 

CM 

CO 

• 

<*> 

• 

• 

rH 

o 

vo 

00 

© 

vo 

o 

CM 

o 

rH 

VO 

1 

i 

1 

in 

in 

ov 

in 

vo 

rH 

m 

vo 

ov 

r- 

• 

in 

• 

r~ 

• 

CM 

CO 

co 

ov 

o 

00 

o 

CM 

rH 

VO 

m 

• 

• 

• 

r- 

ov 

10 

vO 

Ov  vo 
r-  r» 


r~  m 

<r  oo 


CO  CM 
O  iH 


o 

CM 

in 

OV 

t" 

CO 

OV 

VO 

t". 

MT 

• 

r- 

• 

c* 

• 

OV 

in 

rM 

in 

CM 

00 

r- 

O 

CM 

rH 

c* 

rH 

1 

CO 

o 

'S’ 

00 

c- 

r* 

CM 

CO 

o 

vo 

© 

rH 

rH 

CM 

CO 

CM 

OV 

o 

r- 

OV 

ov 

rH 

• 

r- 

• 

o 

• 

CO 

CM 

r- 

CM 

o 

in 

rH 

rH 

co 

VO 

• 

VO 

• 

vo 

• 

CM 

r~ 

in 

o 

VO 

o 

o 

in 

r- 

HJ1 

MT 

rH 

rH 

1 

1 

1 

OV 

CO 

CM 

OV 

OV 

c- 

o 

o 

rH 

• 

1 — l 
• 

rH 

• 

CO  LT) 

in  ov 
ov  in 


P  C  W 

X!  O  O  C 


C  P  B  J= 

H  «  -H  -H 

K  3E  H  Eh 


T3 

>o 

C 

e 

a>  <3  >i 

0>  <0  >i 

P  X 

p  .* 

pen 

pen 

x:  o  o  e 

x:  o  o  c 

o  o  n  <9 

o  o  n  <9 

c  u  g  A 

c  u  e  x: 

•rt  «J  -H  -H 

•H  <0  *H  «H 

»  S  fM  Eh 

33  X  Eh  Eh 

•pcco 

x:  o  o  e 

o  o  n  3 

e  u  s  a 

•H  flj  -H  *H 


Analysis  of  Residuals 

The  main  purpose  for  applying  a  random  coefficients 
model  to  the  airframe  cost  data  was  that  the  residuals  gen¬ 
erated  by  Marcotte,  using  a  more  standard  model/  did  not 
appear  to  be  truly  random.  One  final  question,  then,  needs 
to  be  answered:  did  the  use  of  this  random  coefficients 
model  solve  the  problem? 

Several  sets  of  residuals  were  examined  in  trying  to 
answer  this  question.  The  results  of  the  analysis  of  the 
labor  hours  residuals  using  load  factor  as  the  aircraft 
parameter  in  matrix  X  are  presented  here  for  illustration. 

Figure  3  shows  a  plot  of  these  residuals  against  the 
natural  logarithm  of  the  cumulative  quantity  for  each  lot. 

A  different  constant  for  each  type  of  aircraft  has  been 
added  on  the  horizontal  scale  to  divide  the  point3  by  air¬ 
craft  type.  Note  that  within  each  aircraft  type  there  still 
appears  to  be  a  significant  slope  to  the  residuals. 

These  residuals  are  the  values  of  e  observed  with  this 
data.  According  to  equation  13 

e  -  ZV  +  U  (13) 

and  it  is  V  and  U  that  are  assumed  to  be  random  with  this 
model,  not  c.  Since  we  have  assumed  that  the  variance- 
covariance  matrix  of  U  is  given  by  cuI,  equation  13  is  in 
the  form  for  ordinary  least  squares.  We  can  therefore  esti¬ 
mate  V  by 
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Figure  3.  Labor  hours  residuals  (ord.)  vs.  natural  log  of  quantity  (abs. 


V 


2'e 


(76) 


-  (Z'Z)"1 

and  solve  for  U  using 

6  =  e  -  ZV  (77) 

Figures  4  and  5  show  the  plots  of  $  and  U,  respectively. 

A 

Although  V  appears  to  be  fairly  random,  there  is  still  a  very 
significant  slope  to  the  0  residuals  in  many  cases.  The 
model  assumptions  are  still  being  violated. 

In  equation  13,  Z  does  not  contain  any  intercept  term. 

A  A 

By  adding  the  intercept  we  can  resolve  for  V  and  U  and 
obtain  different  results.  Figures  6  and  7  show  the  new 
plots  of  V  and  U.  In  this  last  case  there  no  longer  appears 
to  be  any  significant  slope  to  any  of  the  residuals,  which 
indicates  that  the  assumptions  are  finally  satisfied. 
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Figure  5.  U  without  intercept  vs.  natural  log  of  quern  tit y 


V.  Summary#  Conclusions#  and  ftocognandatlona 

( 

This  chapter  summarises  the  reasons  for  performing 
this  study  and  rsvisws  the  work  done.  Some  conclusions 
about  the  results  sre  then  presented  along  with  soma  ideas 
about  how  the  method  could  be  applied.  Finally ,  several 
recommendations  for  further  study  are  disoueeed. 

lams* 

The  field  of  coat  estimation  grow  out  of  the  desire 
of  decision  makers  to  have  more  accurate  ways  to  predict 
weapon  system  costs.  Aircraft  airframes#  in  particular# 
required  increasingly  large  amounts  of  money  to  buy.  A 
number  of  reports  have  been  written  to  date  to  try  to 
^  find  better  ways  of  generating  Gift's  for  airframes.  A 

recent  report  by  Marootte  (ftsf  3)  yielded  very  high 
values  for  ll  #  but  also  revealed  that  some  of  the  results 
did  not  satisfy  the  statistical  assumptions  about  random¬ 
ness  of  the  error  term.  That  assumption  was  required  in 
order  to  derive  confidence  intervals  fer  the  estimated 
values.  its  violation  allowed  additional#  unexplained 
sources  of  error  to  affect  the  confidence  limits. 

To  eliminate  this  potential  problem#  this  study 
applied  a  random  coefficients  model  to  the  same  data, 
floma  theoretical  development  was  required  to  transform 
the  baslo  model  into  a  form  that  could  be  applied  directly 
to  the  data.  Once  the  theoretical  work  was  completed#  it 
(  was  used  to  oreate  a  computer  algorithm  that  could  produce 
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ths  desired  CER  coefficient*.  The  algorithm  was  then  run 

2 

using  the  available  aircraft  data  and  an  R  value  was 

2 

oalculated  for  each  CER.  The  CER  with  the  highest  R 
value  for  each  cost  element  was  then  presented  in  equation 
form.  This  study,  then,  recommends  specific  equations 
for  use  in  airframe  cost  estimation. 

Conclusions 

The  coefficients  generated  with  only  a  constant  term 
in  matrix  X  are  reasonably  close  to  the  coefficients  pro¬ 
duced  by  previous  studies.  This  agreement  improves  the 
confidence  in  the  method  used  in  this  report.  In  most 
oases,  a  substantial  improvement  in  the  R2  values  was 
obtained  in  the  recommended  CER.  Furthermore,  the  model 
was  finally  able  to  eliminate  the  unexplained  slope  in  the 
residuals,  so  that  all  the  model  assumptions  were  satisfied. 
For  the  above  reasons,  this  study  concludes  that  the  random 
coefficients  modal  is  indeed  viable  in  estimating  air¬ 
frame  costs. 

One  other  use  of  this  model  is  also  possible.  After 
two  lot  purchases  of  a  particular  aircraft,  it  is  possible 
to  estimate  the  error  term  V  for  that  aircraft.  For 
future  purchases,  this  estimate  can  then  be  used  to 
further  reduce  the  error  in  the  predicted  costs  because 
part  of  the  error  term,  ZV  +  u,  has  been  estimated.  This 
type  of  Improvement  on  cost  predictions  is  not  available 
with  some  other  model  types. 
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Recommendations  for  Further  Study 

The  model  as  presented  can  provide  substantial  improve¬ 
ments  in  estimating  airframe  costs.  However,  a  few  aspects 
of  the  model  are  as  yet  incomplete,  and  the  model  could  be 
improved  through  further  study  of  these  aspects.  This  sec¬ 
tion  gives  several  possible  areas  for  further  study. 

First,  it  should  be  possible  to  determine  a  probability 
distribution  for  the  overall  error  term,  e.  Given  that  dis¬ 
tribution,  it  would  then  be  possible  to  apply  statistical 
hypothesis  tests  to  the  results  in  Chapter  IV.  Confidence 
intervals  for  the  coefficients  and  cost  predictions  could 
also  be  obtained.  These  statistical  tests  would  allow  a 
more  complete  comparison  of  this  model  with  others  previously 
developed. 

Second,  this  study  imposed  the  restriction  of  only  one 

aircraft  parameter  appearing  in  matrix  X.  The  theory  is 

fully  compatible  with  more  than  one  parameter  being  included 

2 

in  X.  These  additional  parameters  could  increase  the  R 
values,  and  if  found  to  be  statistically  significant  they 
could  further  enhance  the  model's  usefulness. 

Third,  it  should  be  noted  that  in  order  to  eliminate 
the  obvious  slopes  in  the  residuals,  it  was  necessary  to 
include  an  intercept  term  in  the  Z  matrix.  The  theory 
developed  in  this  study  cannot  explain  these  intercepts. 
Additional  work  could  be  done  regarding  their  source,  pre¬ 
diction,  and  possible  uses. 
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The  goal  of  all  of  these  research  areas  is  to  produce 
better  CER's.  By  providing  better  cost  estimates  to  the 
decision  maker,  he  can  thus  make  better  decisions  and 
improve  the  overall  military  efficiency. 
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